OPERATIONAL DISTANCE AND FIDELITY FOR QUANTUM CHANNELS 



VIACHESLAV P. BELAVKIN 1 '*, GIACOMO MAURO D'ARIANO 2 ' 3 ^, MAXIM RAGINSKY 3 t 



ABSTRACT. We define and study a fidelity criterion for quantum channels, which we term 
the minimax fidelity, through a noncommutative generalization of maximal Hellinger dis- 
tance between two positive kernels in classical probability theory. Like other known fideli- 
ties for quantum channels, the minimax fidelity is well-defined for channels between finite- 
dimensional algebras, but it also applies to a certain class of channels between infinite- 
dimensional algebras (explicitly, those channels that possess an operator-valued Radon- 
Nikodym density with respect to the trace in the sense of Belavkin-Staszewski) and in- 
duces a metric on the set of quantum channels which is topologically equivalent to the 
CB-norm distance between channels, precisely in the same way as the Bures metric on the 
density operators associated with statistical states of quantum-mechanical systems, derived 
from the well-known fidelity ('generalized transition probability') of Uhlmann, is topolog- 
ically equivalent to the trace-norm distance. 
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1. Introduction 

Many problems in quantum information science 1 1 2 1, both in theory and in experiment, 
involve finding a set of quantum-mechanical states or channels that solve some sort of an 
optimization problem, typically formulated in terms of a numerical criterion that measures 
how close a given pair of states or operations are to each other. (Many such criteria have 
been proposed to date, each defined with specific theoretical or experimental considerations 
in mind; see Ref. [3 1 for a recent comprehensive survey.) 

Let us first consider the case of quantum states, i.e., density operators. Let f) be a 
complex separable Hilbert space associated to a quantum-mechanical system. Given a pair 
of density operators p, a, i.e., positive trace-class operators with unit trace, one can use 
either the fidelity 0|5]|6||3 

F(p,a):=Tr[^W /2 ) 1/2 ] (D 
or the trace-norm (half-) distance 

I?(p,o-) :=|||p-o-|It» ( 2 ) 

where ||p|| T := Tr |p|, \p\ := (p^p) 1 ^ 2 |8][9). Loosely speaking, two states p and a are 
close to each other if F(p, a) is large, or if D(p, a) is small. In fact, as follows from the 
key inequality I5"l ll0l 

1 - F(p, a) < D(p, a) < y/\ - F»(p, a), (3) 

the fidelity and the trace-norm distance are equivalent in the sense that any two density 
operators that are close to one another in the sense of Q are also close in the sense of (|2ji, 
and vice versa. 

As for quantum channels, i.e., normal completely positive unital mappings from an 
operator algebra B — B(r)) into another algebra A — B (g), where g and t) are complex 
separable Hilbert spaces, things get somewhat complicated. Consider, for instance, the 
case when g is finite-dimensional, and let m := dimg. Fix an orthonormal basis {|j)}S=i 
of 0, and let :— mT 1 ! 2 Y^jLi \J) ® \J) ^ e me normalized maximally entangled vector 
in the product space g (8fl. Given two quantum channels <£>, W : B — ► A, one can measure 
their closeness in terms of the fidelity of the states on B <£> A, obtained from the maximally 
entangled state it = \ip) (ip\ by applying the predual channels <I> T and ^ T (cf. Section|2]for 
precise definitions) to the first factor in the tensor product: 

m m 

d> T ® id(Tr) = - E E $ t ® l<X*l = P 

»=l fe=l 

-.mm 

$ g, id(7r) = _ ® = v- 

i=i k=i 

The fidelity F (p, a), taken as the channel fidelity 

J c '($,*) := J F 1 ($ T ®id(vr),* T ®id(7r)) (4) 

by Raginsky in Ref. 1111 . enjoys many properties parallel to those of the fidelity Q for 
quantum states. Alternatively, one can adopt the (half-) distance lfT|ll2l fT3l 

©($,*):= i||*-*||cb. (5) 

where || • || c b denotes the so-called norm of complete boundedness (or CB-norm for short; 
cf. Section 1231 for details). We note that the CB-norm half-distance (|5} can be given in 
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terms of the trace-norm distance between density operators by means of the variational 
expression fT1 fT2l[T3l 

= supD($ T (g)id(7r),^ T (g)id(7r)), (6) 

where the supremum is taken over all density operators tt on the tensor product space g ® q. 
By analogy with density operators of the states, we are tempted to say that two quantum 
channels, $ and ^, are close either if .F(<I>, is large or if 2?($, vp) is small. However, 
in addition to the finite-dimension restriction dim g < oo [the only case under which the 
definition @ of the channel fidelity makes sense], we encounter the following difficulty. It 
turns out 11 II that, as a criterion of closeness, the CB-norm distance (|5} is strictly stronger 
than the fidelity measure @ in the sense that even when V P) is large, J-($>, may be 
quite large as well, and may even become equal to one in the limit dim g — > oo. Consider, 
for instance, the case = id. Then one can show 1 1 1 1 that 

1 -£>($, id) < J"($,id) < y/l - (l/4)X> 2 ($,id), (7) 

and we immediately see that when $ is such that T)(Q, id) attains its maximum value of 
unity, the fidelity !F($, id) is still bounded between and \/3/2. To make matters worse, 
the only bound on (|5} in terms of (0} known so far is 

1 >£>($,*) > l-J-($,*), (8) 

as follows readily from Eqs. l|3} and l|6}. Furthermore, one can easily find sequences {$,„}, 
of channels $ m , <J m : B(C rn ) — > B(C rn ), such that D($ m , * m ) ^ for aU m, 

while 

lim ^($ m ,* m ) = 1. 

m — >oo 

Indeed, consider the unitarily implemented channels 

$ m (B) = UlBU m , * (B) = V r \BV m 
with the unitaries U m , V m chosen in such a way that U m ^ V m but 

lim -Tr(UlV m ) = l. 

m — ^oo jji 

Thus, the channel fidelity apart from being applicable only in finite-dimensional set- 
tings, has the distinct disadvantage of not being equivalent to the cb-norm distance, in 
contrast to the case of the Uhlmann fidelity Q and the trace-norm distance (0 on the state 
space of a quantum-mechanical system. 

The goal of this paper is to define and study a new fidelity criterion for quantum chan- 
nels, which we term the minimax fidelity and which is a noncommutative generalization of 
maximal Hellinger distance between two positive kernels in classical probability theory. 
Unlike the channel fidelity @ of Ref. II II . the minimax fidelity is not only well-defined 
for channels between finite-dimensional algebras, but also applies to a certain class of 
channels between infinite-dimensional algebras (explicitly, those channels that possess an 
operator-valued Radon-Nikodym density with respect to the trace in the sense of Belavkin- 
Staszewski [ 14 1) and is equivalent to the CB-norm distance, echoing the way the Uhlmann 
fidelity Q for density operators is equivalent to the trace-norm distance (0. 

Apart from these technical features, the minimax fidelity /($, *&) between two quan- 
tum channels $, \p has a direct operational meaning: intuitively, it is defined as the mini- 
mum overlap of output states (density operators) of the predual channels $ T , when the 
operator-sum decompositions 1 2 1 of the latter are chosen to be maximally overlapping; this 
is spelled out in precise terms in Section l4~5l Our central result (Theorem[0 demonstrates 
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that the minimax fidelity is independent of the order of these two optimizations. Further- 
more, the equivalence of our minimax fidelity to the CB-norm distance, which is stated 
precisely in Section [5] in terms of dimension-free bounds, is a promising avenue for the 
study and characterization of dimension-free bounds (whenever they exist) on other opera- 
tionally meaningful distance measures for quantum operations Q in terms of the CB-norm 
distance. As pointed out in Ref. 1 15 1, such bounds are crucial for a successful generaliza- 
tion of the usual quantum capacity of a channel [l] |2| (i.e., with respect to the identity 
channel) to the case of comparing quantum channels to an arbitrary reference channel. We 
plan to pursue these matters further in a future publication. 

The paper is organized as follows. In Section|2]we fix the definitions and notation used 
throughout the paper. The minimax fidelity is then introduced in Section [5] Section 0] is 
devoted to the evaluation of the minimax fidelities in the various mathematical settings 
that arise in quantum information theory. Next, in Section [5] we list key properties of 
the minimax fidelity. Finally, in Section [6] we sketch some example applications of the 
minimax fidelity to several problems of quantum information theory. 

2. Preliminaries, definitions, notation 

2.1. Pairings, states, operations. Let h be a complex separable Hilbert space; let B de- 
note the Banach algebra Z?(h) of all bounded linear operators on f) with the usual operator 
norm || • ||; and let B T denote the Banach space B T (t)) of trace-class operators on fj with 
the trace norm || • || T . The set of normal states on B, i.e., ultraweakly continuous positive 
unital linear functionals on B, will be denoted by S(B) or, whenever we need to exhibit the 
underlying Hilbert space explicitly, by 5(h). Generic elements of S(B) will be denoted 
by the stylized Greek letters w, g, ^. Note that the operator norm on B can be written as 
\\B\\= S wp{ g (\B\):geS(B)}. 

We equip f) (and shall equip all Hilbert spaces introduced in the sequel) with an isomet- 
ric involution J = ,P , J 2 = If,, having the properties of complex conjugation, 

J X! 'V/,. = Yl A ••■''/;• VAj S C, //, S h. 

i j 

We can thus define the transpose of any B e B as B := JB^ J, as well as introduce the 
trace pairing 1 16) 

(B, p) := Tr(Sp) = Tr(Bp), VB € B, p € B T (9) 

of B and B T . Under this pairing, which differs from the usual one in that B 6 B is paired 
with the transpose of p 6 B T rather than directly with p, normal linear functionals on B 
are in a one-to-one correspondence with the elements of B T . Thus to each normal state g 
we associate a unique positive trace-class operator with unit trace, denoted by the standard 
Greek letter p and referred to as the density operator corresponding to g, via g(B) = (B, p) 
for all B e B. Similarly, density operators corresponding to states denoted by w and c 
will be denoted by it and a respectively. 

Apart from natural arguments from standard representation theory of operator algebras, 
one reason why we chose to pair B with the transposed operator p — Jp^J, rather than with 
p, is to be able to keep all notations conveniently parallel to the classical (commutative) 
case, as will be amply demonstrated throughout the paper. Note also that we can fix a 
complete orthonormal basis {\j)} of f) and express the pairing (|9) in terms of the matrix 
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elements of B and p as 

(B,p) = $>l£|fc) • (j\ P \k) = J2 B ^ k , 

j,k j,k 

where we have used the covariant indices for the matrix elements of bounded operators in 
B and the contravariant indices for the matrix elements of trace-class operators in B T , when 
the latter are identified via the pairing © with normal linear functionals on B. Yet another 
reason to opt for the pairing of B with the transposed operator p, further elaborated upon 
in Section 12.21 is that then the density operator p of a normal state g will coincide with 
the operational density of g, understood as a quantum operation from B into the Abelian 
algebra C. 

Introducing another Hilbert space g, the algebra A :— B(g) and the trace class A T := 
B t (q), let us consider quantum operations, i.e., the completely positive normal linear map- 
pings $ : B — > A such that $(!(,) < 1 B ; if $(lf,) = l fl , then $ is referred to as a 
quantum channel. Any quantum operation $ possesses a unique predual $ T : A T — ► £> T , 
defined as the transpose of $ with respect to the trace pairing (|9j, i.e., 

($(B),p) = (fl,$ T (p)), VB€B,p€A T . (10) 

Conversely, given a normal completely positive linear map $ : A T — ► B T such that 
Trf, < Tr g p for all p E A T , we define its dual with respect to the trace pairing (|9jl as 
the unique mapping $ T : B — ► A for which 

(B,<f>(p)) = (<t>T(B),p), VBeB, P eA T . (11) 

Using these definitions, one readily obtains that $ J = <£> for any normal completely posi- 
tive map $ : B — ► A. Alternatively, one may define the predual of a normal completely 
positive map $ : B — ► A as the unique normal completely positive map $ T : A T — ► B T 
such that <&j = $. 

If $ is given in the Kraus form fT71 $ (B) = ^F^BFj, or more generally as an 
integral 

$(B)= / F( z y BF(z)dp(z), (12) 
J z 

with respect to a positive measure p on a measurable space (Z, Bz), where the integration 
is understood in the sense of Bochner 11 81 . then the predual map <£> T has the transposed 
integral form 

* T 00= / F J (z) t pF J (z)dp(z), 
Jz 

where 9^ i — > (£,\Fj (z) are Hilbert-transposed to the operators f) 9 r\ i — > (r/\F (z), 

that is Fj (z) = F~{z) for all zeZ. 

Any normal state g £ S(B) is automatically a quantum channel from B into the Abelian 
algebra C, and it is readily seen that the density operator p of g, understood as acting on 
A € Con the right, C 9 A i — ► Xp, is precisely the predual g T : C — ► B T . Indeed, given 
B 6 B and A € C, we have 

(g(B),X) = (B,Xp) = (B,g T (X)), 

which proves our claim that p = g T . Thus we also have that g = p| = p T . 
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2.2. Operational densities. In order to avoid technicalities involving unbounded opera- 
tors, we shall henceforth assume that all quantum operations we deal with are completely 
majorized by the trace, considered as the map r (ct) = l fl Tr a of £> T into A = B (g), in the 
sense 1 14 1 that there exists a constant A > such that the difference At — $ is a completely 
positive map B T — > A. For example, this condition is satisfied by all quantum operations 
between finite-dimensional algebras 1 19 1. As was proven in 1 14 1, in this case there exists a 
unique positive operator $ T on the Hilbert space 7i := Q ® h, called the density of $ with 
respect to the trace r, such that 

$GB)=TH [(l fl ®B)$ T ], (13) 

where Trf, 7, $(7i), denotes the partial trace of F with respect to h, 

(Tr„r,p) = (y,p®ll„), V/>e£ T (g). 

Moreover, <£> T as a linear operator on H is bounded and majorized by A: < $ T < Al^, 
and the operation is unital, ^(llf,) = l s [contractive, ^(li)) < 1 8 ] if and only if Tr;, $ T = 
l fl (TYf, <£> r < l g ). This is equivalent to saying that the predual map $ T : A T — ► £> T , 
which, using Eqs. dlOt and (1131 . can be written as 

$ T (p)=Tr fl [$ T (p®l 6 )], (14) 

is trace-preserving (trace-decreasing). 

As an example, consider a normal state g on B, which, being a quantum channel into C, 
satisfies the complete majorization condition with A = ||p||, where p is the density operator 
of g. Furthermore, it is easy to see that g T = p. Indeed, we can write 

g(B) = (B,p) = Tv(Bp) = Tr(Bp) = Tr„ [(l c ®5)p], 

and the desired result follows upon comparing this with Eq. Jl 31 . This provides additional 
justification for our definition of the trace pairing in Eq. (|9j, since we then have that £ T = 
p = q t for any normal state g. 

If the operation $ : B — ► A is given in the generalized Kraus form dl 2i . we can write 
down its operational density $ T explicitly. To this end, suppose that all operators F (z) 
are determined by generalized bra-vectors T (z) = (F (z) |, densely defined as the linear 
functionals 

T(z)\t®n) = (£\F(z)\ri) = {F{z)\\Z®Tjj) 

on the linear span of the ket-vectors |£ ® rj) = £ ® rj in TL = g ® h, where ^ € g is also 
treated as a bra-vector such that J£ = (£| and |^) = ^. Then the operational density <3? T of 
$ is given by the corresponding decomposition 

*r = J r r (z) (z) = r+r, (15) 

where the integral is, again, understood in the sense of Bochner. 

2.3. Completely bounded maps. Completely positive linear maps between operator al- 
gebras are a special case of completely bounded maps |20|. Consider, as before, the alge- 
bras B = B(\)) and A = B(q). For each n £ N define the nth matrix level Ai n (B) ~ 
B ® A4 n , where A4 n denotes the algebra ofnxn matrices with complex entries. That is, 
A4 n (B) is the space ofnxn matrices with S-valued entries, 

M n (B) := {[Bij] : B %1 €BA<i,j< n} . 
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Analogous construction can also be applied to A to yield the matrix levels M n {A). Each 
matrix level M n {B) inherits a *-algebra structure from B through 



[Bij][C i: 



y ] BikCkj 
,fe=i 



In fact, by identifying M n (B) via a natural *-isomorphism with the algebra B(l}^) of 
bounded linear operators on r/™), the direct sum of n copies of f), one can make M. n {B) 
into a C*-algebra. Thus, each matrix level of B possesses a unique C*-norm. 

Now, for any n € N a linear map A : B — ► A induces the map A(") := A ® id n from 
M„(B) into M n (A), defined by A (n) : [B i3 ] i — > [A(Bij)]. Let us define the norm of 
complete boundedness (or CB-norm) by ||A|| c b := sup{||A( n )|| : n e N}, where 

l|A (n) || := sup \\\ {n) (B)\\ 

BeM n (B),\\B\\<l 

is the usual operator norm of A^™) . A linear map A : B — > A is called completely bounded 
if ||A|| c b < oo. Every completely positive map $ : B — ► A is automatically completely 
bounded, with ||$|| c b = ||$(lfj)||. For a general completely bounded map A, one has, by 
definition, ||A(1„)|| < ||A|| < ||A|| cb . 

Passing to the predual map A T : A T — ► B J7 we can similarly define induced maps 
A^ : M n {A T ) — > M n (B T ), n e N, and the predual CB-norm 

||A||J b :=sup||AW|| T , 

n£N 

where 

||A(")|| T := sup ||A(")(p)|| T . 

P eA4„(^ T ):||p|| T <l 

It is easy to see that ||A( n )|| = ||A T n) || T for all n e N, so that ||A|| cb = ||A T ||J b . It 
is also straightforward to see that the "unstabilized" norms || • || and || • || T are tensor- 
supermultiplicative (i.e., 1 1 Ai <g> A 2 1| > ||Ai|| ||A 2 ||), whereas the corresponding CB-norms 
are tensor-multiplicative (i.e., ||Ai <S> A2 || c b = II Ai || c b|| A2 || c b)- 

There is also a useful non-variational formula for the CB-norm of a map A : B — ► A. 
Namely, let £ 2 denote the Hilbert space of square-summable infinite sequences of complex 
numbers, and let JC(t 2 ) denote the space of compact operators on I 2 . Then ||A|| c b = 
|| A ® id^(^2j ||. Since we have assumed that B = B(f)) with h a complex separable Hilbert 
space, and since all complex separable Hilbert spaces are canonically isomorphic to £ 2 , we 
may also write ||A|| cb = ||A <S> id K(f ,)||. 

2.4. Miscellany. Any positive operator B e B(t)) has a unique positive square root, de- 
noted by B 1 / 2 and defined as the positive operator X € £>(()) such that B = X 2 . This 
definition can be extended to any operator A that is similar to a positive operator A e B(i)), 
in the sense that there exists an operator S G B(t)) such that A — SAS + , where S + is 
the pseudoinverse of S, equal to S^ 1 on ran S and to on kerS 1 . In that case, we may 
define y/J := S'A 1 / 2 5+. From now on, in order to distinguish this extended definition of 
the square root from the usual one, we shall always use the square root symbol y/i for this 
extended definition, and reserve the exponent notation • 1 / 2 for the usual definition. 

Consider now two positive operators A, B e B(h). It is easy to see that their product 
AB is similar to A 1/2 BA 1 I 2 with S = A 1/2 . Note that the operator AB is positive when 
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restricted to the closure of ran A, when the latter is equipped with the weighted inner 
product (v\x)a ■= (A- 1 / 2 v\A- 1 / 2 x)-- 

(v\ABv) A = (A- 1/2 v\A- 1/2 ABv) = (v\Bv) > 0, € vmA. 

Thus we may define y/AB := S{A 1 / 2 BA 1 ' 2 )S + with S = A 1 ' 2 . 

This notation, again, allows for a convenient parallelism between the classical (commu- 
tative) formalism and the quantum (noncommutative) one. Indeed, consider two mutually 
commuting positive trace-class operators p, a, let {\x}} denote the set of their common 
eigenvectors, and let p x = (x\p\x),a x = (x\a\x) denote the corresponding eigenvalues. 
Then ^fpa is also trace-class, and 

Tr ^fpa = ^ 

X 

If Trp = 1 = Trcr, then P := {p x } and Q := {<r x } are probability distributions, and 
Tr ^fpa then gives the classical fidelity (also known as the Bhattacharyya coefficient) 1 1 1 
F(P 7 Q) between P and Q. 

Our main technical tool in this paper is given by the following: 

Lemma 1. Let Ti be a complex separable Hilbert space, and let R, S £ B(Ti) be positive 
operators such that R}/ 2 SR}/ 2 is trace-class. Then the supremum 

sup {T*(Xty + ytx) :X^X = R,Y^Y = S} =2Ti VRS (16) 

is achieved on any X G B(7i) satisfying the condition X^X = R, say X = R 1 / 2 , and 
Y = Y satisfying the equation 

Y Q X^ = {XSX^) 1/2 = XY%. (17) 

Proof. To prove the lemma one can use either the polar decomposition or the method of 
Lagrange multipliers. We shall use the latter. Fixing an X satisfying X'X = R, we can 
write the Lagrange function as 

L = TrpftY + Y^X - Y^YL), 

where L = L' € B(H) is the operator- valued Lagrange multiplier corresponding to the 
hermiticity condition S = Y^Y = . At the stationary point 

81 = Tr(A"t - LY^)5Y + (X - YL)SY^ = 0, 

so Y = Y a must satisfy the equation YL = X (the other equation, LY^ = X', corre- 
sponding to Y"t = YJ, is obtained by taking the Hermitian adjoint). Thus Y Q — XL^ 1 , 
where L^ 1 should be determined from L~ 1 X' r XL~ 1 = S. Multiplying this on the left 
by X and on the right by X* yields {XL^X^) 2 = XSX\ or XL^X^ = {XSX^) 1 ' 2 . 
Thus, we indeed have that Y Q X^ = {XSX^) 1 ' 2 = XY%, and therefore that 

Tr(F X t + XY%) = 2Tr((XSX f ) 1/2 ). (18) 

This extremal value is precisely the maximal value due to convexity of the function being 
maximized in Eq. (QTj). Note that, since (WXSX^U) 1 / 2 = WiXSX^) 1 /' 2 !! for any 
unitary U, the value of the supremum in Eq. dl6> . which coincides with Eq. (1 1 81 , does 
not depend on the choice of X satisfying X'X = R. Indeed, by virtue of the polar 
decomposition X = UR 1 ^ 2 , 

2Tr(XSX^) 1/2 = 2Tr(U J <(XSX^ 1/2 U) = 2 Tr ({R 1/2 SR 1/2 ) 1/2 ) . 
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Rewriting this trace in the equivalent form 2Tr(X 1f Y ) with 

X^Y = R l / 2 {R l / 2 SR l ' 2 ) l / 2 R- 1 ' 2 = VRS 

corresponding to X — i? 1 / 2 , we obtain the extremal value in Eq. dl6l . □ 

We shall also need the following simple, but useful, result: 

Lemma 2. Let S be a compact subset of a complex Banach space V, such that x G S 

implies Xx £ S for all A £ C with |A| = 1. Let f : V > C be a continuous function 

which is homogeneous of order 1, i.e., /(Ax) = Xf(x) for all A £ C and all x £ V. Then 

sup \f(x)\ = sup Re f(x). (19) 

Proof. Let a;* £ She such that \f(x*)\ = sup xeS \f(x)\, with/(x*) = \f(x*)\e im ' sf ( x '\ 
Let x** := e~ i&Isfl - x '^x*. By the homogeneity of /, 

f( x **) = e- iars/(;E * ) /(^*) = |/0*)|. 

But then \ f(x**)\ = f(x**) = Ref(x**). Since Re A < |A| for all A £ C, the lemma is 
proved. □ 



3. Operational fidelities and distances 



3.1. Classical kernel fidelity. The fidelity distinguishing different quantum operations 
without the restriction on the Hilbert space dimensionality was suggested by Belavkin 
in Ref. 1211 on the basis of a noncommutative generalization of the maximal Hellinger 
distance between two positive kernels. Namely, given a locally compact space X and a 
measure space (Y, By, n), where /i is a positive measure, let us denote by A the algebra 
C(X) of bounded continuous functions on X, and by B T = Cj(Y) the space of absolutely 
/i-integrable complex functions on Y. A positive kernel P is then given in terms of a 
function p(» | •) : Y x X — > R+, such that P x := p{»\x) £ B 1 for all x £ X, while 
P := f Y p(y|»)d/i(y) £ A. Given two positive kernels P and Q, the squared pointwise 
Hellinger distance 



d 2 H (Px,Qx) := \ f (^p{y\x)-^q{y\x)) 2 &ix{y) 



-(p(y\x) + q(y\x)) - yjp(y\x)q(y\x) 



dfi(y) (20) 



is well-defined and finite for each x £ X, so that we can define 

2 



d Z „(P,Q) 



1 

^ xGX 



V^^y-V 7 ^) dfi(y)=\\d 2 H (Px,Q x )\\, (21) 



the last expression indicating the fact that (^(P, Q) is given by the supremum of the 
squared pointwise Hellinger distance (I20> over all x £ X . Note that the squared Hellinger 
distance dj^(P, Q) between two positive distributions P = p(u) and Q = q(u) is the 
minimal mean quadratic distance 

d 2 H {P,Q) = I mf _ \ I \x(y) - Hy)\ 2 My) ■ lx(-)l 2 =p(.),|^(.)| 2 - <n» 



inf <^ / 

2 x ^ec(Y) {J 

1,\{P + Q) 



sup 

V-:|^(.)| 2 = 9 (.) 



v / p(y)Retp(y)dfj,(y), 



(22) 
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where (/, P) = J f (y)p(y)dfi(y) denotes the integral pairing of / 6 C(Y) with P e 
C T (Y). The relative fidelity 



f(P,Q) = sup / y/p(y)^Bi>(v)dn(y) 

i/(l,P)(l,Q) V:|^(.)| 2 =g(.) J 

(i,VPQ) (23) 



of the distributions P and Q is obviously related to the distance i22\ by 



<4(P, Q) + V(l,P)(l,Q)f(P, Q) = [l, -(P + Q)J . (24) 

If P x := p(»|a;) and := q(»|x) are conditional distributions with constant integrals 
(1, P x ) and (1, Q x ), e.g., normalized to unity, this relation also remains valid for the mini- 
mal fidelity 

f(P,Q)= mtf(P x ,Q x ), 
which can alternatively be defined by the minimax formula 

f{P,Q)= inf sup , (25) 

^ x V>:|lK.|aO| 2 =Q.(.) V(l, Px)(l, Qx) 



where the supremum is achieved on tp(»\x) = ip (*\%) satisfying ip(y\x) = \/ q{y\x). In 
particular, if P and Q are probability kernels, (1, Px) = 1 = (1, Qx) for all x E X, then 

d 2 H (P, Q) = 1 - inf / V^M^)d A1 ( 2/ ) ee 1 - f(P, Q), 

x£X J 

where 



f(P, Q) = inf / ^p(y\x)q(y\x)d^y) = inf (1, y/P*Q x ) (26) 
is the minimax fidelity of the classical channels described by these kernels. 

3.2. Quantum operational fidelity. Generalizing Eq. J21i . one can define the squared 
Hellinger distance between quantum operations <£> and "J with the respective operational 
densities $ r , * r e B(H), H = g ® h, as 

<4($,*) = ^ m f {||Tr^(r-T)t(r-T)|| :rtr = $ T ,TtT = * T }. (27) 

The operators T, T e B(W), such that r+r = $ T and T+T = * T , are naturally thought of 
as the purifications of <& r and respectively. This means that we can fix an orthonormal 
basis {\j)} of H, say the product basis \j) = \i) gj = |i, fc), where {\i)} and {|fc)} are 
some fixed orthonormal bases of g and f) respectively, and represent any such T and T as 
strongly convergent sums 

r = J2 UXiir = £ bWI, T = £ U>0'|t = £ liX^-l. (28) 

where the generalized bra- vectors (Fj\ define the bounded operators Fj,Vj ■ g — > f) 
through 

(fcl^li) = (FjlQi) ® |fc)) = (j|r|z,fc), (Al^li) = (V*|(|i) ® |fc» = 01T|z,fc). 
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As seen directly from this definition, the mapping (F\ i — > F is linear: (aF + bG\ i — ► 
aF + bG. Using Eq. (1281 . we may write 

*r = I^K^-I = r f r, * r = WW = t^t, (29) 

where the sums converge in the strong operator topology. This determines the Kraus de- 
compositions G3 $(^) = Ej ^- -B-F). = Ej V j BV 3 of the ma P s * : ^ — ► 
A Analogously, upon defining the mappings F, V : g — ► f) ® by 

:= ^ Fju ® Vv := ^ t> ® |j>, 
i i 

we can write the maps $, in the Stinespring form |22| as $(£>) = F^{B ® tu)F and 
= V T (S® t H )V. 

Taking into account the fact that || j4^|| = sup eg5 ( g ) g(A j( A) and defining the positive 
function 

c(.;«):B(W)xB T (fl)— >R 
c(A;p) :=iTr(A(p®l„)At), 
we can rewrite the fidelity distance (I27> in the following minimax form: 

t&(#, ¥) = inf J sup c(T - T;p) : T+T = $ T , T T T = * T I . (30) 
r,T G B(W) 1 ees(s) 



On the other hand, generalizing Eq. d2Qi to quantum operations, we can define the squared 
pointwise distance 

inf {c(r-T;p) : rtr = $ r ,TtT = * r } (31) 

between $ and 'I' on the set S(g) of all normal states on A = B(g). Just as with the prob- 
ability kernels in the commutative setting described in the preceding section, d^(^, *£) 
coincides with the supremum of d 2 H ($, ^)(g) over all normal states g e S(g) whenever $ 
and W are (proportional to) quantum channels: 

Theorem 1. Let '5 : B — ► A be quantum operations with the respective operational 
densities <£> T , ^ T 6 B{TL). Suppose that for all g € S(q) the pairings 

(^ T , P (g> i„) = e[*(i 6 )], (* r ,p® !„) = e[tt(i 6 )] (32) 

are constant. Then 

dfr(*,*)= sup ^(^^(e). (33) 
ee5( B ) 

Furthermore, then we have that 

1 



+ vMli*ll/(*, *) = 2(II $ H + 11*11)' < 34 > 

vv/iere 

ReTr [$r /2 T(p ® 1k)1 
/(*,*)= inf sup - 1 ) H ^ J (35) 

ee5( fl ) xeB(w):rtT=* T y/Q [$ (lf>)]\/£> [* (*&)] 

i's f/ze minimax fidelity between $ ant/ 'I'. 
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Proof. Fix an arbitrary g 6 S(g). From Eq. d32l > it follows that 

||$|| - sup qMi„)] = e[*(iij)], 

eeS( B ) 

and the same goes for *. Therefore, given any pair T, T G B(TL) such that T^T = <£> T and 
Tt Y = * T , we can write 

c(T-T;p) = iTr((r-Y)t(r-Y)(p®^)) 

= i Tr ((* T + * T )(p <g> 1 6 ) - (T+T + rT+)(p 8) 1„)) 



+ ||vl/|| -Tr[(rtT + rTt)(p®l b )]), 



whence it follows that 

= inf c(r-Y;p) 



r : r f r=* T 

T:T t T=* T 



+ ||*||- ^ sup Tr[(rtY + rYt)( P ®^)] 



2 v r : r + r=* T 

T:T + T=* T 

Taking the supremum of both sides over all g 6 5(g), we obtain 
sup dk($,*)0?) = + ||*|| 



inf sup Tk[(T t T + rT t )(p®l(,)]). (36) 
ee5(g) r . r t r=$T ' 

T:T t T=* T 



On the other hand, 

rfff($,*) = inf sup c(r-Y;p) 
r : r+r=$ T e es( g ) 

X:T t T=* T 



i inf sup (||*|| + 1|*|| -Tr[(rtY + rYt)(p® 1 6 )]Y 



2 r:r+r=* T eGl s( B 

T:T t T=* T 

which yields 

4($,*) = i(||$|| + 1|*||- sup inf Tr[(rtY + rYt)(p®lf,)]). (37) 

T:T 1 'T=* T 

Note that the right-hand sides of Eqs. d36i and d37i differ only in the order of the extrema. 
Thus, establishing the validity of Eq. fl33i amounts to justifying the interchange of the 
extrema. 

According to Lemma [2 the supremum over V and Y in Eq. d36i can be evaluated by 
fixing T = first and then varying only over all Y S B(Ti) such that Y^Y = * T . 

1/2 

By the polar decomposition, any such Y has the form U * T for some partial isometry U. 
Thus we have 

sup Tr^I^Y + TT^)(p ® If,)] = 2 sup ReTr[*y 2 Y(p <g> If,)] 
r : r t r=* T X:Ttx=* T 

T:T t T=1' T 

= 2supReTr[*y 2 ?7*y 2 (p® !„)], (38) 
u 
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where the supremum in Eq. (I38i is taken over all partial isometries U such that 

^1/2^^1/2 _ 

Since the expression being minimized is linear in U and since any partial isometry can 
be expressed as a convex combination of at most four unitaries, we may instead take the 
supremum over the unitary group U(7Y) and, in fact, over the entire unit ball Bi(H) := 
{X G B(H) : \\X\\ < 1}: 

sup ReTr[$y 2 T(/9® = sup ReTr[$y 2 X1<y 2 (><g) (39) 

T:TtT=* T XGBi(W) 

Since the expression being maximized in the right-hand side of Eq. (1391 is affine in both 
X and p, and since Bi(7i) and 5(g) are closed convex subsets of B(H) and B T (g) respec- 
tively, it follows from standard minimax arguments 1 23 1 that we can indeed interchange 
the extrema to obtain /_($,*) = where 

/_($,*) := inf sup Re Tr[$y 2 X*i /2 (p ® If,)] 
ee<S(fl) xeBi(w) 

/+(#,*) := sup inf ReTr[$y 2 X*y 2 (p® If,)], 

which proves the claim of Eq. (1331 . The rest is straightforward. □ 

As seen immediately from Theorem[0 when $ and are quantum channels, then 

+/(*,¥) = 1, 

with the minimax fidelity given by 

/($,*)= inf sup Re Tr [$y 2 T(p® ![,)]. (40) 

4. Evaluating the fidelity distances 

4. 1 . Fidelities for quantum states and quantum effects. Consider two normal states g, c 
on B = B(t)) as quantum channels from B into the Abelian algebra A — B(g) with g ~ C. 
In this case, the operational densities g T , q T of g, c, coincide with the corresponding density 
operators p, a: g T = p and <^ r = a. The predual maps g T , ? T : A T ~ C — ► B T can then 
be thought of as the state creation operations, g T (A) = \p and ^ T (A) = Act for A 6 C. 

In order to compute the minimax fidelity f(g, we have to consider all x,i J ^ B that 
give the decompositions p = x X an d 17 = ipty- Note that we can always write these 
decompositions as purifications 

3 3 

where \xj) ■— x\j)> l^j) : — V'li) w i tn respect to a fixed orthonormal basis {| j)} of h. We 
then have the minimum quadratic distance 

d 2 H (g,c.) = - inf sup w[{\ ~ VO f (x ~ VO] 

ipZLB-.ip^ ip — a 

= \ inf Tr [(x - ?A) t (x - VO] , 
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where the last equality is due to the fact that dim g = 1. Expanding the product under the 
trace, we can write 



Tr(p + a)- sup {ReTr(x t i/>) = X f X = P, V'V = CT } 



= 1- sup ReTr(xV) 

ip£B:ip^ */)= cr 



(41) 
(42) 



= (43) 

According to Lemma ^ the supremum in Eq. ( 1421 is attained at any x € satisfying 
the condition x X 

(xctx 1 ) 172 = xi>k 



p, say x = p 1 ^ 2 , and ^ = ip satisfying the equation VoX 



sup ReTr(x'V') 



sup {ReTr(p 1/2 V) : = o"} 



Observe that the standard Uhlmann fidelity between the density operators p and a, F(p, a) 



Thus the minimax 



in Eq. 0, can be written as F(p,a) = Wp 1 / 2 ^ 1 / 2 ^ = Tr 
fidelity between two normal states g and q on B, or, equivalently, between the state creation 
operations g T , q T : C — ► Bj(t)), agrees with the Uhlmann fidelity between the respective 
density operators p and cr of g and 

Next we turn to the other extreme case, namely that of the state annihilation operations 
\& with the preduals $ T (p) — ($ r , p)< ^t(p) — (^t> p)> corresponding to dim h = 1. 
They are completely specified by the effects, i.e., the positive operators $ T , ^ T 6 £>(g) sat- 
isfying < $ r , < l g , which can be purified as in ( 1291 . where Tj — (j\T, Tj = (j\T 
are the bra-vectors corresponding to an othonormal basis {\j)} in g. The squared point- 
wise minimax distance between the state annihilation operations $, "J, or, equivalently 
between the effects <l> r , \& r , on the set S(g) of normal states g = p 1 on B(g) is given by 
the minimum 

= 5 p M , {Tr [(T - T)t(T - T)p] : rtr = 4> r , T^T = * r } 

of the quadratic distance between their purifications T, T € B(g). The solution of this 
problem is likewise given by Lemma[2with R = $ T and S = p^ T P- Thus the optimum 

*)(£>) = i Tr[($ T + ¥ T )p] - Tr \/<Mp*tP) 

is attained at any T € S satisfying the condition T^T = <£> T , say T = $y 2 , and the 



corresponding T = T Q satisfying the equation T pT T 
maximum of this distance over all states, 

= sup <P H ($,9)( e ) 

ee5( fl ) 



x/r P * T(0 rt = r P Tt. The 



sup 

ees( 8 ) 



- Tr[($ 
2 LV 



sup inf 

ee5(fl ) r,TGB( B ) 



* T )p] - Tr v'M^pj 

{Tr[(r-T) t (r-T) (0 ] : rtr 



$ r ,TtT-* T }, 
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is given by the minimax quadratic distance 

£&(*,*) = - inf (||r-T|| 2 :r t r = $ T ,T t T = * r l, 
Hy ' ; 2r,Tes( B )l" 11 ' J 

interchange of the extrema following from standard minimax arguments 1 23 1 and the fact 
that all r, T satisfying, respectively, T'T — <f> T and T^T = ^ T are contained in the unit 
ball of 5(g). 



4.2. Semiclassical fidelity. It is straightforward to extend the formalism of Section 13711 
involving the commutative Hellinger distance between two positive kernels to the case 
of mappings from a set X into positive trace-class operators on the Hilbert space f), i.e., 
p : x £ X i — ► p(x) e £> T (f)) and a : x e X i — ► a(x) G B T (f)) with p(x), a(x) > for 
all x € X. We thus have the pointwise Hellinger distance 

1, 



d%(p(x),a(x)) = \t,-\p(x)+v(z)] \ - yj(t,p(x))(t,a(x))f(p(x),a(x)) 
in terms of the trace pairing (B, p) = Tr (Bp) of B e B = B(\)) and B T = S T (h), where 



f(p(x),a(x))- - Tr^^VM 



V(l,p(as))(l,^(x)) VTrp^IVa^) 

The semi-classical operational distance between p = and cr = cr(») can then be 
defined as 

<4(p,cr) = Bupd(p(af),(r(a;)) = \\d(p(»), a (•)){{. (44) 

When Tr p(x) = 1 = Tr u(x) for all x E X, i.e., when p and cr are classical-to-quantum, 
c-q (or semiclassical) channels, Eq. J44i can be written as d^(p, cr) = 1 — /(p, cr), where 

/(p,cr) = inf Tr^^) 1 /^^)^) 1 / 2 ] 1 / 2 



= inf Tr y/p(x)a(x) 
= inf F(pr»,<r(aO) 
is the minimax fidelity of cr relative to p. 

4.3. Semiquantum fidelity. Next we consider the opposite of semiclassical operations 
— namely, the semiquantum operations which correspond to quantum measurements as 
quantum-to-classical (q-c) channels. Such operations are given as 

$(&)= ^b(y)$ T (y)dp(y) = (b,$ T ) 

on the algebra B = C(Y) of continuous bounded functions b : Y — ► C, where (Y, By, p) 
is a measure space, by specifying the positive operator-valued Bochner /i-integrable func- 
tions $ T : Y — >A = B(q). If 

*(1) = (1 > * T ) = 1„ 
the predual maps A 3 p i — ► $ T (p)(») G C T (F), 

$ T (p)(y) := (#r(w),p) = e[*r(y)], 

define for each input quantum state £> e 5(g) a classical probability density on (Y, 23y, /z), 
that is, they describe quantum measurements by the positive operator-valued measures 
(POVM's) M(dy) = $ T (v)Mv)- 
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In order to avoid technicalities in defining the semi-quantum fidelity distance between 
two q-c channels <&, ^> : B — ► A, we shall assume that $> T (y), ^ T (y) are weakly con- 
tinuous bounded functions on Y. Then the squared distance d%j(&, can be written as 



= inf 

r,T:rtr=* T ,Ttr=* T 



(r(y)~r(y))\r(y)~r(y))dv(y) 



(45) 



where the decompositions T^T = $ T and T^T = fy T are understood in the pointwise 
sense as 

<£ T (y) = r(y)tr(y), Hf T (y) = T(y^T(y), My e Y, 

The infimum in Eq. ( 145 \ is achieved at any T E A ® C T (Y) satisfying the condition 
rtr = $ r , say r(y) = ^(y) 1 / 2 , and the corresponding T = T Q satisfying the equation 

T (y) P r(y)t = [r(y)pv|/ r ( 2; ) /9 r(y)t] 1 /2 = r(y)pT ( tf )+. 
The maximum of this minimal distance over all states, 

= sup / (\ Tr [($ T (y) + * T (y))p] - Tr J $ T (y)(p* T (y)p)) dp (y) , 

is equal to d 2 H ($, '&) = 1 — /(<&, 'P) in the measurement operation case $(1) = 1 B = 
$(1), where 

/(*,¥) = inf f TrJ<f> T (y)(pV T (y)p)dn(y). (46) 



4.4. Operational fidelity formula. Now we can easily evaluate the minimax formula <|30] 
for the fidelity of two general quantum operations $, : B — ► A, B = B(h), A = B(g) 
The solution of this problem is also given by Lemma [2 with R = $ r and S = (p (g 
l f) )^I> T (p (gi If,). For a given g E S(g), the supremum in 



2 

2 sup {ReTr^T^®!,,)] :r t r = $ T ,T t T = * r }) 

r,xeB(w) ' ' 7 



is equal to Tr y/Q T [(p® IfO^rCp® f^)]> an d i s achieved at any T G satisfying 
the condition r^T = <l> r , say T = , and the corresponding T = T Q satisfying the 
equation 

T Q {p® if,)rt = [r(p® i 6 )* T (p® i(,)r f ] 1/2 = r( P ® i 6 )Tj. 

When '5 are quantum channels, or, equivalently, when the preduals $ T , ^ T are trace- 
preserving, Theorem^says that the maximum of this distance over all states, 



sup Tr -(* T + * r )(p®l ft )-V*r[0»®l| ) )*rO'®lft)] , (47) 
can be written as d 2 H (§, = 1 — /($, where 



/($,#) = inf W$ r [(p®l 6 )# r (p<8>l 6 )] (48) 



is the minimax fidelity between $ and "J. 
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4.5. Operational fidelity in terms of Kraus and Stinespring decompositions. Con- 
sider, as before, two quantum channels <£>, : B — ► A, where B = £>(h) and A = B(g). 
Given the minimax fidelity 

/($,*) = inf sup ReTr[T^T(p® If,)] 
ees(g) r . r t r=$T 

T:T t T=* T 

inf sup iTr^TO® 
e£S(s) r . r t r=$T 

T:T t T=* T 

between $ and '5, where the second equality follows from Lemma|2] the supremum over 
all r and T satisfying, respectively, T^T = $ T and T^T = fy T can be replaced with the 
supremum over all Kraus decompositions of $ and \&, i.e., over all collections {Fj}, {Vj} 
of bounded operators g — ► h, determined from $ T , ty T via Eqs. d29t and d28i : 



inf sup I V g(FlV/ 



(49) 



/(*,*) 

Just as in the proof of Theorem^ we may restrict ourselves only to those T, T that can be 
written as T = U$ 
^ X J 2 in the form of Eq. d28t as 



1 /2 1 /2 

T = Fw T ' for some unitaries U, V. Thus, if we write $ T ' and 



3 3 

then it follows that, given a unitary U, we can write 



and similarly for T = \AJy 2 . Thus 

/(*,¥) = inf sup ^[^(^(V)] 

f?es(fl) j/eu(«) 1 ~^ 

Turning now to the infimum over all normal states g on A = fi(fj), we may equivalently 
consider all pairs {<£>, /C}, where <p is a normal * -representation of .4 on a Hilbert space JC: 

/(*,*):= inf sup l^/J^OO^ 

Since all normal ^-representations of the full operator algebra B(g) are unitarily equivalent 
to an amplification B i — ► i? (g> 1{ for some Hilbert space J, we can write 



/($, *) := inf sup V (v \Fj(U)^Vj <g> 1, 
u6d«it;HI=l (7eU(W) 1 ~^ ^ 1 

Introducing the vectors \v, \v, £ g ® ? ® 7i, defined by 

\v, $) := ^(Fj (8 (8 \v, *) := ® t t )v <g> 

we obtain yet another form of the minimax fidelity: 

/(*,¥) = inf sup \{v,&\t a9t ®U\v,V)\. 



(50) 



(51) 
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For a fixed v € g <g> t, taking the supremum over {/ is tantamount to taking the supremum 
of | (x\0 1 over all pairs of unit vectors XjCGfl®^®^ sucn that 

which, in conjunction with the standard results on the Uhlmann fidelity Q between density 
operators |6, 7|, finally yields 

/(«,*) = inf F($ T ®id(|w)(w|),* T ®id(|t;)(t;|)) 

«eB®t:||t)||=l 

= inf F($ T ®id(/j),# T ® id(p)) . 

ee5(g®t) 

Note that we may always take t isomorphic to g: 

= inf F($ T ®id(|w)(u|),# T <g)id(|v)(v|)). (52) 
ue_e<g>s , ||u |[=i 

Given some Kraus decompositions {i*}}, {Vj} of $ and ^ respectively, we may define 
the operators 

p t ■■= E F ^ ® b'>> ^ := E ^ ® !•?') 

from g into h (8 W and express $ and * in the Stinespring form = F^(B ® 

= 0(-B <g> (cf. Section|l2). Then we may rewrite Eq. lg9} as 

inf sup|Tr(F / 97 t )| , 

where the supremum is over all F, V : g — ► f) ® 7i giving the Stinespring decompositions 
of $ and respectively. We may, as before, fix F and V, say, by considering the 'canon- 
ical' Kraus decompositions {Fj}, {Vj}, and instead take the supremum over all unitaries 

/($,*) = inf sup |Tr[(lf, ®C/') J PpV t ]| 

= inf sup |Tr[i7Tr|,(F/>V t )]| , 
ee-S(fl) c/eu(H) 

which yields another useful formula 

/(*,*) = inf WTr^FpV^Wj (53) 

for the minimax fidelity between the channels <f>, ^. It is, in fact, not hard to show that 
the right-hand side of Eq. (I53> does not depend on the particular choice of the Stinespring 
operators F, V, as long as we agree to dilate the input Hilbert space fj by the 'canonical' 
auxiliary Hilbert space Ti = g h. 

We note that the constructions of this section are valid more generally for channels given 
in terms of the continual Kraus decompositions 

$(B) = / F{z)^BF{z)d^{z), = [ V{z) i BV{z)dv{z), 

J z J z 
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provided that the measures p and v are equivalent, i.e., absolutely continuous with respect 
to each other. Then Eq. J49I is a special instance of the more general expression 



/($, = inf sup 

ees(s) {F(z)},{v(z)} 



z 



y / dv/dpF(z) t V(z)dp(z) 



where dv/dp is the Radon-Nikodym derivative of v with respect to p, for the case when 
both fi and v are counting measures, dp = dv = 1, on a finite or countably infinite set. 

5. Properties of the operational fidelity 

In this section we establish several key properties of the minimax fidelity between quan- 
tum operations. These properties follow almost immediately from the corresponding prop- 
erties enjoyed by the fidelity Q on density operators: 

(F.l) F is symmetric, F(p, a) = F(a, p), bounded between and 1, and F(p, a) = 1 

if and only if p = a; 
(F.2) F is jointly concave over all pairs of density operators; 
(F.3) F is unitarily invariant, i.e., F(p, a) = F(U pU\ UcrU') for any unitary U; 
(FA) F is monotone with respect to quantum channels: F(^ 1 (p), $ T (er)) > F(p,a) 

for every quantum channel $. 
(F.5) the Bures distance ds{*, •) := v/1 — F(*, •) is topologically equivalent to the 

trace-norm half-distance D(u, •): 

2- 1 ' 2 D{p 1 a) < d B (p,a) < ^D{p,a) 

[cf. Eq. 0]. 

Property (F0, in fact, follows from strong concavity of F Q, i.e., 

F C^2piPi-^2 vm) > ^2 VPivi F (Pi> a i) ( 54 ) 

i i i 

for all < pi, Qi < 1 such that = 1 = 1i- 

Using Eq. (I52> . we can immediately obtain for the minimax fidelity /(•,•) on pairs of 
quantum channels the following analogues of properties (F[0-(F|4j of the fidelity F(», •) 
on pairs of density operators: 

(f.l) / is symmetric, bounded between and 1, and /($, 1 J r ) — 1 if and only if $ = 'I'; 
(f.2) / is jointly concave over all pairs of channels; 

(f.3) / is invariant under both left and right composition with unitarily implemented 
channels, i.e., 

f(Guo $,6^°*) =/($,¥) 

and 

/($oe v ,fo9 v ) =/($,*) 

for any two channels $, ^ : 6(f)) — ► B(f|) and any two unitaries U € U(g), 
y S U(h), where Qu(B) := WBU, and Q v is defined analogously; 
(f.4) / is monotone with respect to both left and right composition with quantum chan- 
nels, i.e., /($o $i, *o $ x ) > /($, and /($ 2 o $, $ 2 ° *) > /($, *) for any 
two channels $, ^ : B — ► A, all channels $i into B, and all channels $2 on A. 
Just as in the case of the fidelity between density operators, the minimax fidelity / pos- 
sesses the strong concavity property 
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On the other hand, deriving for the minimax fidelity / an analogue of property (F|5jl of the 
Uhlmann fidelity F requires a bit more work. To this end, let us consider two channels 
5>, \P : B — > A, B = B(i)), A = B(g). Suppose first that g is infinite-dimensional and 
separable. Then g ~ I 2 , and we can rewrite Eq. d52l as 

/($,*) = inf f ($,®id(|«)(«|),* T ®id(|«)(«|)). 
ues®€ 2 ;|H|=i 

The space £ 2 contains, as a dense subset, the pre-Hilbert space l\ of all infinite sequences 
of complex numbers with all but finitely many components equal to zero. Using this fact 
and the continuity property (F0 of the fidelity F, we obtain 

/(*,*)= inf F($ T ®id(|u)(v|),tf T ®id(|u)(i;|)). 

veg®i%;\\v\\=l 

Using this expression in conjunction with Eq. Q, we get the bounds 

/($,*) > 1- sup D(* T ®id(|v)(v|),* T ®id(|u)(i;|)) (56) 
veg®e%:\\v\\=i 

/ 2 ($,*) < 1- sup D 2 ($ T (g>id{\v)(v\) -* T ®id(|u)(u|)). (57) 

Now, for any completely bounded map A : £>(h) — ► B(g), the image of the set {|i>)(i>| : 
v £ £% \\v\\ = 1} under the predual map A T <g> id : B t (q <g> £%) — > £> T (f) (g) ^j) is 
contained in the trace-norm closure of the linear span of : ( G f) <8 -^q, ||£|| = 1}, 

which is dual to the tensor product B(t)) ® /C(£ 2 ), where JC(£ 2 ) is the space of compact 
operators on ^ 2 . Thus, by duality we have 

sup D($ T ® id(|u)(u|), * T ® id(|w)(u|)) 
■ueB®<o : IMI=i 

= ^ll(*-*)®idx:(<a)|| 

where denotes the CB-norm half-distance (1/2)||$ — *|| c b, and the last equality 

follows from the formula || A|| c b = || A ® idx.(p) || for any completely bounded map A. 

On the other hand, when dimg = rn < oo, we can use the fact |20| that, for any 
completely bounded map A into B(g), 

||A|| ch = ||A®idMj| = ||A T ®idx m || T , 

where A4 m denotes the algebra of rn x m complex matrices, whence it follows that 

sup D($ T ® l(|-u)(u|),* T ® t{\v){v\)) = £>($,*). 

In either case, we immediately derive the inequality 

1 - £>($, *) < *) < V 1 - £> 2 ($, (58) 
which, when expressed in terms of the Hellinger distance du(*,*) '■= v/l — /(•, •) as 

2- 1/2 P($, *) < d ff ($, *) < v/p(¥7*), (59) 
yields the desired property 

(f.5) the Hellinger distance <£#(•,•) := \fT— /(•, •) is topologically equivalent to 
the CB-norm distance [cf. Eq. ( I59H . 

This completes our survey of the basic properties of the minimax fidelity /. 



OPERATIONAL DISTANCE AND FIDELITY FOR QUANTUM CHANNELS 



21 



6. Some examples and applications 



The expressions for the minimax fidelity, derived in Section |4] for different kinds of 
quantum operations encountered in quantum information theory, share the common feature 
of being set up as variational problems, namely, as minimizations of a concave functional 
over a convex set. This feature of the minimax fidelity renders the problem of computing 
it amenable to robust numerical methods (see Ref. [3 1 for detailed discussion of numerical 
optimization methods for the calculation of fidelity-like measures in quantum information 
theory). However, there are instances in which the minimax fidelity between two quantum 
channels can be written down in a more explicit form. In this section we sketch some 
examples of such instances. 

Before we proceed, we would like to remind the reader of the assumption we made in 
Section [2~2l namely that all the channels we deal with are completely majorized by the 
trace in the sense of Ref. 1 14 1. This assumption, while allowing us to circumvent certain 
technicalities involving unbounded operators, is somewhat restrictive, as one can easily 
find examples of quantum channels between infinite-dimensional algebras (e.g., unitarily 
or isometrically implemented channels; see Ref. 1211 for details) that do not satisfy this 
condition of complete majorization. However, owing to the CB-continuity of the minimax 
fidelity [cf. Section [5], we may always regard such channels as CB-limits of sequences 
of channels with finite-dimensional output algebras. Thus, given a channel $ : B — > A, 
B = 6(h), A = B(q) with dimg = oo, we consider a sequence {P n } of finite-dimensional 
projections such that P n — > 1 B strongly, and the corresponding sequence {$„} of quantum 
operations := P n Q(B)P n , so that Q n (B) — > uniformly as n — > oo for each 

B e B, and each <!>„ is a channel from B into P n AP n , with lirrin^oo ||$ — <J? n || c b = 0. 

With this in mind, in the examples below we shall not worry about the issue of bounded 
vs. unbounded operational densities. 

6.1. Unitary maps. In the case of channels Qjj, @v implemented by the unitaries U, V : 
\) — ► h, i.e., Qu{B) = WBU and Q V (B) = V^BV, the minimax fidelity /($, is 
easily evaluated using Eq. d49b : 



where we have defined W :— U'V. Let Sp(M /r ) denote the spectrum of W, which is a 
closed compact subset of the unit circle T in the complex plane, and let E w (dz) denote 
the corresponding spectral measure of W. Then we can write 



where co Sp(M / ) denotes the closed convex hull of Sp(T4^), anddist(z, S) := m£{\z—z'\ : 
z' e S} for any z £ C and ScC. Clearly, f{Q Vl Q v ) = 1 if and only if coSpT^ C T, 
i.e., if and only if W — Alp, with |A| = 1, which is equivalent to Qjj = Qy, 

When dimf) < oo, Sp(W / ) is a finite subset of T, so that coSp(W^) is a polygon in 
the complex plane, and Eq. i60\ shows that f(&u> ©v) is simply the distance d from 
this polygon to the origin. On the other hand, recalling the formula [ 12 1 T>(Qjj, 0y) = 
Vl — d 2 , we see that the upper bound in Eq. J58I is saturated by the unitarily implemented 
channels. 



/(©£/, ©V) 




f(Qu,Qv) = inf / zM w < s (dz) , 
eeS( s ) Jsp(w) 

(dz) is the probability measure g[E w (dz)} = (E w (dz), p). Thus 




where M w ^ 



/(e a ,©y) = dist(0,coSp(W0), 



(60) 
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6.2. Random unitary channels. Continuing with the set-up from the preceding example, 
let us consider channels of the form 

=J2Pi @ Ui( B l -51*6^(5), (61) 

i i 

where 0^ are unitarily implemented channels and p = {pi}, q = {qi} are probability 
distributions. It the follows immediately from the strong concavity property d55i of the 
minimax fidelity that 

f^,^)>J2VPWi = F(p,q). (62) 



When dim \) < oo, the inequality in i62\ becomes equality when the unitaries Ui are 
orthogonal in the Hilbert-Schmidt sense, TrLfJ Uf. = dimrj • Sik- On the other hand, 
when f) is infinite-dimensional, this orthogonality condition does not make sense unless 
we consider channels given in terms of continual Rraus decompositions, so that the sums 
in Eq. i62\ are replaced with integrals with respect to some positive measure p, and agree 
to understand orthogonality in the sense of operator-valued Schwartz distributions. As an 
example, consider the following. 

Let \) = T, the boson Fock space, let a and a) be the field annihilation and cre- 
ation operators, and let D(z) :— exp(z(V — za), z S C, be the unitary displacement 
operators obeying the Weyl relation D(z)D(z') = e llmzz D(z + z'). Given a function 
/ G L 2 (C,dz), where dz := d(Re z)d(Imz), we define its Weyl-Fourier transform as 
D(f) := 7r~ 1//2 J c f(z)D(z)dz. Since / is square-integrable, D(f) is a Hilbert-Schmidt 
operator, and it can be easily shown that 



Tr [D(f)^D(g)] = / f(z)g(z)dz=(f,g) LH c), V/,jei 



2/ 



so that Tr [D(z)^D(z')] = irS^ ( z - z '), z, z' e C, where 6^ (A) := <5(Re X)S(Im A) is 
the Dirac (5-function in the complex plane. 

With this in mind, consider the family of channels : B(J-) — > (i e M + , 

with the preduals given by 

rW(p) :=— f D(z)pD(z)Uxp(-\z\ 2 /ti)dz 

(in quantum optics these channels model the so-called Gaussian displacement noise |24|). 
Then the minimax fidelity between and T^' is given by 

/(r w ir M )= HL. (63) 

Owing to the inequality between the geometric and the arithmetic means, the right-hand 
side of Eq. d63i is always bounded between and 1, and the maximum value of 1 is attained 
if and only if p — v, i.e., T 1 -^ — T^'. This, of course, agrees with the properties of the 
minimax fidelity (cf. Section[5}. 



6.3. Master equation. Consider a strongly continuous semigroup of channels {$w : 
B(\)) — > S(h)} te jj+, with the preduals $j satisfying the Lindblad master equation 1251 

d^jHp) = XpX] _ h x ^ Xp + pX ^ x) (64) 



dt 
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for some X € £>(fj)- Introducing the dilating Hilbert space 7i = t) ® f) with the basis 
{|0), |1), . . .}, we can, for an infinitesimal time t = e, write the predual of the channel 
3>( £ ) in the Stinespring form 

^(p) =Tr n A £ pAt, (65) 
where the map A e : h — > h (g) 7i is given by 



A e v := - 7^ f ^) « ® |0) + Ve-^w ® I 1 ) + C(e 2 ), 



(66) 



0(e 2 ) indicating terms with norm bounded from above by Me 2 for some constant M > 0. 
Note that Aqv — v ® |0), so that T(°5 = id. We can then evaluate the partial trace 

Tr„[A E p4]= M-I £ (XtX)^ |0)<0| + ViWp|l>(0| + O(e 2 ), (67) 

where (_B) p := Tr(Bp) for B 6 <B(h). Then, again up to an additive term of operator 
norm 0(e 2 ), 

TT^pA^Tv^ApAl}^ 



l-\e{X^X) p ) + :|,.V; ( 



><0|, (68) 



which allows us to compute, up to 0(e 2 ), the minimax fidelity between the channel 
after an infinitesimal time e and the identity map. Using Eq. J53I . we obtain 



/(T«,id)= inf ||Tr„[A e Mt]|| T « VT^TC, (69) 

where 

C= inf ((X^X) p -\(X) p \ 2 ). (70) 
e65(s) 

6.4. Impossibility of quantum bit commitment. The statement of topological equiv- 
alence of the noncommutative Hellinger distance and the CB-norm distance between a 
pair of quantum channels, i.e., Eq. i59\ , is essentially the "continuity argument" at the 
heart of a proof of "impossibility of quantum bit commitment (QBC)" I26I . Quantum bit 
commitment is a cryptographic objective in which one party, Alice, commits a bit to an- 
other party, Bob, in such a way that the corresponding protocol is concealing (i.e., Bob 
is not able to retrieve the bit before the opening) and binding (i.e., Alice cannot change 
the bit after the commitment). The impossibility proof asserts that if the protocol is per- 
fectly concealing, then it is necessarily not binding, and invokes a continuity argument 
for "asymptotically" concealing protocols, stating that Alice's probability of successful 
cheating approaces unity, while Bob's cheating probability becomes close to the value 1/2 
(pure guessing). 1 In this example we derive the continuity argument from the expression 
of Alice's and Bob's respective cheating probabilities as a consequence of the topological 
equivalence between the Hellinger distance and the CB-norm distance in Eq. i59i . 

From the point of view of Bob, Alice's action of committing the bit is equivalent to 
a channel $^(i>) on an algebra $(fj), dimt) < oo, for each value of the committed bit 
b = 0, 1, where = {Ay }* =1 is a collection of operators satisfying the Kraus condi- 
tion X)jLi ^ A^ = 1, and $^<b) denotes the channel induced by this Kraus decompo- 
sition. At the opening, Alice informs Bob about which element of the Kraus decomposition 



'The reader should be aware that the impossibility proof in Ref. 1261 is valid for a restricted class of protocols, 
i.e., those that are non-aborting and have a single commitment step. For wider classes of protocols, it is still a 
matter of debate whether a secure QBC protocol exists 1271 . 
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she actually used in the commitment. However, prior to unveiling the label j, Alice 
can perform an EPR attack with the purpose of changing the Kraus decomposition to an- 
other equivalent decomposition A {b) (V) = {Af\v)}, where Af\V) := ^i^Yjt 
for some V G U(C fe ). The EPR attack is achieved by Alice via the unitary transforma- 
tion V on an ancillary fc-dimensional space Ti. The conditional probability that Alice can 
cheat successfully by convincing Bob that she has committed, say, 6 = 1, while having 
successfully committed 6 = instead, is given by 

P*(V,v = V 3 ' 3 (71) 

T \\{Af®t H )v\\* 

where ® 1-h)u|| 2 is the probability that the jth Kraus element is unveiled. Which 

V should Alice use? Without any knowledge of \v), the best she can do is to adopt a 
conservative strategy of choosing the V that will maximize her cheating probability in 
the worst-case scenario, namely for the anonymous state \v) chosen by Bob to minimize 
P^{V,v). This is the minimax choice of V, corresponding to the cheating probability 

P^ := sup inf P?(V,v), (72) 

V£U(C fc ) v£i)®H;\\v\\ = l 

On the other hand, for equiprobable bit values 6 g {0, 1} Bob's optimal probability of 
cheating is given by the probability of error in discriminating between the corresponding 
output states, more precisely 

Pc=\ + \ sup -p\ w \\j = \[l+V& AWl * AW )], (73) 

where we have defined p v A :— & A ® id(|u) (v\). Using Jensen's inequality, we can bound 
Alice's cheating probability P^(V, v) from below as 



P C A (V»> 



Y t \(Mf\vydi ) ®ln\v) 



(74) 



Note that the value of the max-min in Eq. M2\ will not change if we perform the maxi- 
mization over the closed convex hull of U(C fe ), i.e., the set K(C fe ) of all linear contractions 
on C k , and the minimization over the closed convex hull of the pure states on t) <g) Ti, i.e., 
the set S(t) <£> Ti) of states on S(f) ® TL), thus completing the domain of the max-min to the 
product K(C fc ) x S(t) (S> Ti) of compact convex sets. Now, the functional 

F{V,p) := J2XeTr{p[A?\v?A?> ® t H }} (75) 

3 

is affine in both V € K(C k ) and p 6 5(f) ® Ti), so that we can use standard minimax 
arguments |23 1 to justify the interchange of extrema in Eq. (T72t . and then apply Lemma|2] 
to obtain 

sup inf \F{V,\v){v\)\ = sup inf \F(V,p)\ (76) 
V£U(c k ) vet >®n veK(c k ) ees(t)^H) 

= inf sup \F(V,p)\ (77) 

inf sup \F(V,\v)(v\)\. (78) 

v£l)®H;\\v\\=l y e u(fc) 
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Now, since a monotone function does not affect the saddle point, we can use Eqs. i50\ . 
(72), G3, and (78) to obtain 

P/>/ 2 ($ A(0 ,,$ A(1) ). 
Using Eq. d59l and then Eq. d73l >. we finally obtain the chain of estimates 

P C A > f($Aio),® A v) > [1 " »(^W)» ^w)f > [1- 2 (^<f ~ V2)] 2 , 

whence it follows that, for "asymptotically" concealing protocols, i.e., those for which 
P C B — > |, Alice's probability of cheating will approach unity, and the protocol will not be 
binding. 
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